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q 1 ! ABSTRACT 

U: 

We provide a formula for extracting the value of the runs of the linear matter 
fluctuations on a scale R directly from redshift survey data. It allows us to constrain 
Oh. the real-space amplitude of <7r without requiring any modeling of the nature and 

q ' power spectrum of the matter distribution. Furthermore, the formalism is completely 

S-^ , insensitive to the character of the bias function, namely its eventual scale or non- 

' linear dependence. By contrasting measurements of <jr with predictions from linear 

perturbation theory, one can test for eventual departures from the standard description 
of gravity on large cosmological scales. 
C^l , The proposed estimator exploits the information contained in the reduced one- 

point and two-point cumulant moments of the matter and galaxy density fields, and 
it can be applied on cosmic scales where linear and semi-linear perturbative approxi- 
' mations of the evolution of matter overdensities offer a satisfactory description of the 

. full underlying theory. We implement the test with N-body simulations to quantify 

' potential systematics and successfully show that we are able to recover the present day 

£^ , value of er 8 'hidden' in the simulations. We also design a consistency check to gauge 

CN ' the soundness of the results inferred when the formalism is applied to real (as opposed 

to simulated) data. We expect that this approach will provide a sensitive probe of the 
J> ' clustering of matter when applied to future large redshift survey such as BigBOSS 

and EUCLID. 
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1 INTRODUCTION 



According to a widely accepted paradigm, cosmic structures grew from tiny dark matter density fluctuations present in the 
otherwise homogeneous and rapidly expanding early universe. The standard version of the model incorporates the assumption 
that these primordial and Gaussian distributed fluctuations are amplified by gravity eventually turning into the rich structure 
that surrounds us today. This picture in which gravity, as described by general relativity, is the engine driving cosmic growth 
is generally referred to as the gravitational instability paradigm (GIP). However plausible it may seem, it is important to test 
its validity. 

In the local universe the GIP paradigm has been shown to make sense of a vast amount of independent observations on 
differ ent spatial scales, from galaxies to superclusters of galaxies (e.g. Peacock et al. 2001 ; Tegmark et al. 20061 ; Reyes et al. 



20ld). Deep galaxy surveys now allow us to test whether the predictions of this assumption are also valid at earlier epochs 



i Guzzo et al.ll2008l ; iBlake et al.ll201ir ). In particul ar, they allow us to asses weather GR is the correct theory describing the 



action of gravity on large cosmological scales (e.g. I Jain fc Zhandliool IUzanll2009l ; lAcquaviva fc Gawiserl 2010) 
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Indeed, modifications to GR have been proposed as alternatives to explain observations showing that the universe is under- 

going accelerated expansion fPvali. Gabadadze fe Porratilboool ; Capozziello. Cardone fc Troisil2005 ; Amendola. Polarski fc Tsuiikawa 



mg 
37; 



Bertacca et al 



2007; iBuzzi. Marinoni fc Colafra ncesco 200S). Non standard gravitational models have also bee n invoked as an alterna- 
tive to dark matter 1 Milgroml 19831 : Bekenstein 20041 ) or to its standard physical characterization I Piazza fc Marinonil 20031 : 



20081 ) . Since these modified gravity theories are specifically tuned to explain the uniform cosmic expansion 
history, a possible way to test their reliability is to an alyze the inhomogeneous section of the universe, i.e. cosmological 



perturbations of matter 1 Linder 20051 : Zhang et al. 2007 ). 

It is a non-trivial task, compounded by our poor knowledge of the biasing mechanism, to trace the global matter dis- 
tribution by its luminous subcomponent. An operational definition of bias, that is useful for investigating the hierarchical 
clustering of matter, is conventionally given in terms of continuous density fields, provided that the galaxy distribution is 
smoothed on scales R large enough compared to those where non-gravitational physics operates. One can thus expand the 
dimensionless density fluctuations of galaxies <5 g ,ft(x) at position x in Taylor series of the underlying mass overdensity Sr at 
the same point 



(1) 



where bi are the bias coefficients. It has been shown that, in this large scale limit, such a local transformation preserves the 
hierarchical properties of one-point matter statistics ( Fry fc Gaztanagal 1993h . 

The problem of interfacing theory (mass) with observations (light) stems from the fact that the equations that give 
access to the value of fundamental gravitational quantities (such as the rms of linear matter fluctuations on a given scale R 
(<tr = v Wr)) or t ne growth rate of linear perturbations / = din 5/ In a, where a is the cosmic scale factor) are also the very 
same equations that allows us to extract the value of the bias parameters bi. Since the relevant physical and cosmological 
quantities are generally degenerate with the bias parameters, it is not immediately obvious how to fix their values. Because 
of this, the traditional approach consists in assuming that gravitational and cosmological parameters are independently 

l2002h . 



Lahavetal 



determined to fix the amplitude of the bias coefficients bi (e.g. 

The viability of the opposite route, that is investigating the coherence of the phy sical model given an a-prior i knowledge of 
the bias function, has been systematically explored only recently. Zhang et al. 1 20071 ) and Song fc Percival ( 20091 ) . for example, 



have proposed to assess the soundness of GR by constructing s tatistical ind i cators that are in principle insensitive to the linear 
biasing parameter, i.e. the lower order term in equation (fljl. iGuzzo et al. ( 20081 ) . on the contrary, tested GR by comparing 
the observed and predicted growth rate of matter fluctuations using data from deep redshift surveys. To fulfill their goal 
they adopted the value of the linear bia sing parameter provi ded by independent observations, such as the leve l of anisotropy 
in the Cosmic Microwave B ackground ( Komatsu et al.ll201ll ) or the mean number density of galaxy clusters ( Borgani et al 



2001: ISchuecker et al.ll2003l ). Both the se strategies suffer from the fact that there ar e now convincing evidences about the non - 
linear character of the bias function ( Marinoni et al. 20051 : Gaztafiaga et al. 20051 : Marinoni et al. 20081 : Kovac et al. 20091 ) . 
These testing scheme is also far from being economic, requiring data from multiple and independent probes of the large scale 
structure, redshift surveys, imaging surveys, CMB observations. 

An orthogonal, more general approach, aims at extracting from redshift surveys both the value of or and the bias 
parameters bi . Several authors ha ve shown that, if the initial perturbations are G aussian and if the shape of third order statistics 



such as the reduced skewness 53 ( Gaztana"gal 1994 ; Gaztafiaga fc Frieman 1994 ), the bispectrum ( Frvlll994 



Fcldman et al. 



2001 



Verde et al 



Scoccimarrc 



1998 



20021 ) or the 3-point correlation function ijGaztanaga et al 2005^ Gaztanaga"X*l5coccffnarrol 



2005l : |Pan fc Szapudilbool )" is correctly described by results of the weakly non-inear perturbation theory, then one can fix the 
amplitude of bi up to order 2 in a way that is independent from the overall amplitude of clustering (e.g. as) and depends only 
on the shape of the linear power spectrum. 

In this paper we further explore the potentiality of this ap proach. We exploit the information encoded in a different kind 
of third order statistics, the reduced two-point correlator C12 ( Bernardeau 1996). Following a suggestion of Szapudi (1998) 
we work out the explicit expressions for the bias coefficients up to order 4 as a function of C12 and of the reduced one-point 
cumulant Si (up to order 5). Using these results we construct the central formula of this paper (cfr. eq. ([45])), i.e. an estimator 
of or that is independent from any assumption about the linear power spectrum of matter. As a result, by contrasting the 
redshift evolution of or with predictions of perturbation theory at linear order we can place constraints on the reliability of 
the GIP paradigm and in particular on the viability of general relativity on large cosmological scales. Our formalism relies 
upon theoretical assumptions and approximations that we have tested using cosmological N-body simulations. In this way we 
can assess the overall coherence of the method and the impact of potential systematics. 

The paper is organized as follows: in §2, we introduce the relevant definitions and relations that provide the standard 
framework for understanding the clustering of matter in the weakly non-linear regime. We work out the relationships between 
the two-point reduced correlators of galaxies and matter in §3, and in §4 we show how to use them in order to express the 
bias coefficients bi (up to order 4) as a function of observable quantities. The central achievement of this paper is presented 
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in §5 where we detail the computational scheme that allows to extract both the amplitude and redshift scaling of <jr in real 
space. In section §6 we work out the formalism for mapping continuum variables into discrete observables and we show how 
to implement in practice the proposed testing strategy using simulations. We also design a strategy to test the coherence of 
the results. Conclusions and prospects are drawn in §7. 



2 HIGHER ORDER CORRELATIONS AS A MEASURE OF THE LARGE-SCALE DISTRIBUTION OF 
MATTER 

In this section we give the theoretical background for analyzing the clustering of matter on large cosmic scales as well as the 
mathematical framework to the model presented in Section §3. We first introduce our notation and briefly overview the current 
understanding of the hierarchical clustering of matter in the universe. We then discuss the high order statistical descriptors 
of the density perturbation field and display some fundamental results about their linear and weakly non-linear evolution. We 
finally show how these results are changed when the re levant statistics ar e samp led with a given finite resolution. A thorough 



treatment of the subject can be found in the review of iBernardeau et al.l 1 200: 



mpli 

3). 



2.1 Joint A'-point cumulant moments of the continuous fluctuation field 

Be A(x) the intensity function describing the overall mass distributio n in the universe, i.e. a stochastic field representing the 



density of matter at a given position x (e.e. iMartinez fc Saarll2002D . We consider A(x) as a particular (homogeneous and 
isotropic) realization drawn from an ensemble £ and indicate with .F[A(x)] its probability density functional (PDF). 

Since the intensity field is positively defined, J r [A(x)] is by definition non Gaussian. Its complete characterization requires 
the knowledge of the entire (formally infinite) hierarchy of the if-point expectation values 



(A(x 1 )...A(x Jf )>=, sj{xi) ^ j{xk) 



(2) 

J(x 1 ) = ... = .7(x K )=0 



where 



M X [J] = f 2?A(x)^[A(x)]e , / tixJ(x)A(x) = (e , / Mx)AW ) 



is the K -point moment generating functional, an d where PA(x) represents a suitable mea sure introduced in £ such that the 
total probability turns out to be normalized to 1 ( Matarrese. Lucchin fc Bonomettol 19861 ). 



The cosmological principle guarantees the existence of a non-zero value for the expected value of the A field. It is thus 
convenient to characterize inhomogeneities in the matter distribution in terms of the local dimensionless density contrast 

We assume that the ensemble average over £ is equivalent to averaging a particular realization over different spatial positions. 
If this operation is done on a fair sample then (5(x)) = 0. 

An useful tool for probing the large-scale cosmological structure, because of their relatively simple connection to both 
theory and observations, are the joint if -point cumulant moments of order n = (711,712 tik) of the cosmic overdensity field 

K» 1 ,...,n Jf (*l,...,XlO = («5 ni (xi)-^ K (xx)) c , (4) 

and, in particular, the irreducible if-point autocorrelation functions 

Ki,...,i(xi, xjc) = (<5(xi) ... 5(xj<-)) c (5) 

generally denoted as £k and shortly called correlation functions. By definition, the generating functional of the Appoint 
cumulant moment of the overdensity field is the logarithm of the moment generating functional 

C[J]=\nMs[J]. (6) 

The advantage of computing connected averages instead of statistical averages (^ ni ,...,n K = (xi) ... 8 nK (xjf)J) is 
that cumulants are zero if the random variables representing the value of the stochastic field at different spatial positions 
are statistically independent. Conversely, a cumulant is not zero if and only if the random variables in it are statistically 
"connected". A specific consequence of this properties is that K ni ,...,n.K as any subset of positions x; are displaced to 
infinite separation. Finally, cumulants can be explicitly represented in terms of only the lower order moments, that is as a 
function of fim 1 ...m K (xi, xjf) with ^ rm ^ rij. This follows from taking successive functional derivatives of the generating 
functional C[J] 
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S N C[J] 



l-Til .....Til, 



5J ni ( Xl )...5J nx (x fc ) 



J(xi)=...=J(x K )=0 



(7) 



(8) 



where AT = J^ — i n »- The result can be formally written as (jMeeronlll957h 

j 1=1 7i>™ij i=l K 1 ) 

In this last equation, and niij are non-negative integers satisfying the set of equations 
l 

^ 7i«i« = "-j 
i=i 

and bookkeeping all the possible decompositions 
l 

(S ni (xi) ... S n '< (xk)) -»■ [] ( 5mi1 ( x - ^ ( x ^)) 71 • 
1=1 

In this study we are interested in the joint cumulant moments taken at two different locations xi and X2 up to the order 



(9) 



N = 


m - 


h 712 = 7. For these statistics, from now 


on simply called correlators and indicated as K nm , eq. [8] gives 


/tii 




Mil 




«12 




/il2 




ft 13 




Ml3 - 3/in/i 2 




^22 




M22 - 2 /in 2 - /i2 2 




Kl4 




/il4 — 6/il2/*2 - 4/in/i 3 




K23 




A«23 — 6 /il2/ill — /13M2 - 3 /X12/X2 




KlB 




Ml5 - 5/in/i 4 - 10/ii2/t3 - 10/il3/l2 + 


30/in/i 2 2 


K24 




[124 ~ 4/il2/!3 — 8/il3/ill — 6/!22/i2 — 6 


/il2 2 + 6/12 3 + 24/ill 2 /i2 — /*4/i2 


«33 




/i33 — 6 /il3/*2 ~ M3 2 — 9 /il2 2 + 12 /in 3 


- 9/i22/ill + 18/in/i2 2 


Kl6 




/il6 — 6 /in/i 5 - 20 /il3/*3 - 15 /*12/i4 - 


15/il4/i2 + 120/in/i2/t3 + 90/il2/l2 2 


K25 




/i25 + 40 /ill 2 /i3 — 10 /il4/ill — 5 /il2/i4 


— 20/il 3 /il2 + 30/il2/*2 2 — 10/i22/i3 — 10/1*23/42 - M5/*2 + 120/i3/i2 2 






+20/illjLt2/il2 




K34 




/i 34 — 4 /il 3 /i3 - 12 /il 3 /tl2 - 18 /i22/H2 


- 3/il4/*2 + 24/in/i 2 /i3 + 36/il2/i2 2 - 6/i23/*2 + 6/i3/*2 2 + 



72 /in /Ul2 — 12 j«23Mll — M3/M + 72 /ill/i2/il2 



where we ha ve used the fact that (<5) = 0. In the one-point limiting case one recovers the expressions of the cumulant moments 
kjv given by iFrvl 1)19841 ) (cfr. eq. 17). Note, also, that correlators are symmetric with respect to exchanging the indexes. 



2.2 Dynamics of the mass fluctuations 

Cosmological perturbation theory can be successfully applied to determine the evolution of the hierarchy of moments and 
correlators of the mass fluctuations. Theoretical predictions, however, cannot be derived without the prior assumption of a 
specific, primordial, clustering model. 

An interesting, simple and physically motivated case is that in which the initial density fluctuations in the matter 
component are described by a Gaussian probability density functional. In this case, odd moments are zero and all the 
even moments are completely specified by the moment of order two. However, even starting with an initial Gaussian over- 
density field, gravitational dynamics induces non-zero higher order correlations already in the mildly non-linear regime. In 
particular, by applying weakly non-linear perturbation theory (WNLPT) to compute the correlations that first emerges from 
a gravitationally unstable Gaussian field, one finds that, at leading order (lowest order in K2), 



km = SjvKo 



(10) 



where Sn are structure constants that do not depend on the scale of the fluctuations, on the initial power spectrum and are 
almost insensitive to cosmological parameters ( Peebles 1980l ; Fry 1984 ; Bernardeau 19921 ). 
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While extremely useful to characterize the gravitational properties of matter, the one-point reduced moments Sn do not 
retain information about the spatial configuration of clustering. Interestingly, their most immediate generalization, i.e. the 
correlators n nm show hierarchical scaling properties simila r to those in eg. 1101 Indeed, by assuming that the locations xi and 
X2 are sufficiently separated (large separation (LS) limit). iBernardeaul (|1996l ) showed that 



n + m- 
2 



(11) 



As it is the case for the Sn, even the C„ m statistics (hereafter calledthe reduced correlators of order (n, m) ) do show a 
negli gible dependence on the cosmological mass density parameter, the cosmological constant and cosmic epoch (jBernardeau 
1996). Indeed they are mostly sensitive to the initial conditions (assumed to be Gaussian) and to the physical mechanism 
that drives the distribution of the fluctuations away from the initial state (assumed to be gravity). 
In the LS approximation, the reduced correlators C nm factorize as 



C„lC rr 



(12) 



This key property shows that not all the reduced correlators are independent, and that some of them do not give access to 
complementary cosmological information. 



2.3 Finite resolution effects on the dynamics of the field 



In order to ease the comparison with an intrinsically discrete process such as the distribution of galaxies, as well as to facilitate 
the incorporation of the biasing scheme (e.g. eq.[T} into the analysis, it is useful to smooth the mass distribution on a spatial 
scale R. This is done by convolving the overdensity field 8 with a (normalized) window function of size R. 



5 fl (x) = / S(x')W 



R 



j3 / 

a x . 



(13) 



For a top-hat filter, <5r(x) is just the volume average of the density contrast over a sphere of radius R. Note that the 
smoothed correlators of order N = (n, m) retain some of the information contained in the Nth order correlation function. As 
a matter of fact, 



l^nm,R — 



yn+m. 



dyi...dy„ 



Vfl(xi) 



) (14) 

'Vr(x 2 ) 

where we have assumed a top-hat filter of volume Vr. From a physical point of view K n m,R is the average of the correlation 
function of order n + m over two distinct volumes separated by |xi — X2|. 

Smoothing and averaging are non commutative operations. As a consequence, while the relations given in eos. 191 fTTJl and 
[11] retain their validity when applied to filtered fields, the amplitudes of the smoothed dynamical variables (cumulants and 
reduced cumulants) become, instead, scale dependent. Consider for example the lowest-order non-zero cumulant moment and 
correlator, that is the variance of the mass fluctuations on a scale R 

tr% = k 2 ,r = (<5fl(x)) c , (15) 
and the covariance of the smoothed mass overdensity field 

Cfl (r) = ku,r = (Mx)Mx + r)) c . (16) 
Suppose, further, that mass fluctuations are small (\8\ <C 1) and described by the linear (dimensionless) power spectrum 



A'i = 4tv Ak ns+z T\k), 



(17) 



where A is the normalization factor, n s the primordial spectral index, and T 2 (k) the transfer function jBardeen et al.lll986l ; 
Efstathiou. Bond fc Whitl 19921 ; lEisenstein fc Hu1ll998l ). This implies that the amplitudes of second order statistics evolve as 



a function of time and scale as 

&k{z) = 4(o)d 2 (z)Tr, 

and 

Sn(r,z) = a 8 (0) 2 D 2 (z)g R (r). 



(18) 



(19) 



The n ormalization o f these equations is conventionally fixed at a scale r% = 8h 1 Mpc, D(t) represents the linear growing 
mode (|Peebleslll98(ih , while the effects of filtering are incorporated in the functions 



Tr 



f+°° A 2 L (k)W 2 (kR)dlnk 
J +o ° A 2 L {k)W 2 (kr s ) 2 d\nk 



(20) 
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and 

GrM = ^ fcr (21) 

& 2 L {k)W 2 {kr s )d\nk 

where VF is the Fourier transform of the window function. In Figure 1 we show the scaling of the smoothed two-point 
correlation function (as a function of both R and r) at two different cosmic epochs. There is an overall qualitative resemblance 
between the r dependence of the two-point correlation function £(r) and the R dependence of its smoothed version £n(r). 
More interestingly, the characteristic non-monotonic scaling induced by the baryon acoustic oscillations (BAO) that are frozen 
in the large scale matter distribution survives to the smoothing procedure and stands out also in the second order correlator 
as soon as r approaches ~ 100/i~ 1 Mpc. 

Computing the amplitude of the smoothed reduced cumulants and correlators at the next order (i.e. 5*3, r and Ci2,b) 
requires results from the weakly non-linear perturbation theory. If the primordial mass field is Gaussian and fluctuations with 
wavelength « R are suppressed u sing a top-hat filter, then the third order reduced moment, which is often referred to as 
the skewness of the density field, is ( Juszkiewicz. Bouchet. fc Colombl 1993 : Bernardeaul 1994b| ) 



S 3 ,r = y + 7 fl (22) 



while, in the LS limit, that is for separations r >> R, the reduced correlator of the same order is (jBernar dcau 199 ( 



C 12 ,«(r) = g + i 7 « + i/3 fl (r). (23) 
The effect of filtering is to introduce additional, scale-dependent, coefficients 
_ dloga 2 R 

JR - -t; 5- (24) 

a log R 

such that the reduced moment Ss,r effectively depends on the local slope of the linear power spectrum of density fluctuations 
(decreasing with the slope of the power spectrum), while the reduced correlator C\i,r acquires a specific and characteristic 
non-local dependence. In the following we parameterize the distance between the centers of independent smoothing spheres 
as r — nR, where n is a gener ic real parameter (usually taken, without loss of generality, to be an integer). According to the 



analysis of iBernardeaul (1 19961 ). the LS regime is fairly well recovered as soon as n 3. Interestingly, since WNLPT results 
hold for large R, such a small n defines a separation scale r that is already accessible using current redshift surveys such as 
the Sloan Digital Sky Survey. 



The Pr contribution in eq. I)25|) is usually neglected ( Bernardeau 1996) since, in the LS limit, £,r(t) is simply the two- 



point correlation function of the un-filtered field, a function that effectively vanishes for large separations. This is a critical 
simplification and the domain of its validity deserves more in-depth analysis. By assuming a power-law spectrum of effective 
index n e = —1.2 (i.e. "/r — — (n e + 3) = —1.8), we obtain that, on all scales 7?, the amplitude of Pr(uR) becomes negligible 
(^ 0.08) as soon as n ^ 3. Anyway this rapid convergence to zero is a peculiar characteristic of a scale-free power spectrum. If 
we consider a more realistic power spectrum (cfr. eq. I17p on scales that are accessible to both semi-linear theory and current 
large scale data (i.e. 10 < R < 30/i _1 Mpc, and n ~ 3), the amplitude of the /3r contribution is still significant and varies non 
monotonically as a function of the length scales r on which the cell correlation is estimated. This is illustrated in Figure [2] 
where we contrast the scaling of f3R(nR) and 7_r for different vales of R and n. The systematic error in the estimation of C12 
that is induced by neglecting the pR-term on relevant cosmological scales, is larger than the error with which this statistics 
can already be estimated from current data (see Figure[S]in section §6.2). For example, the amplitude of /3r, for characteristic 
values n = 3 and R — 10(/25)ft _1 Mpc, is ~ 15(/30) per cent that of 7^. Interestingly, one can see that, as for 7^, also the 
value of /3i?(r) does not depend on cosmic time, at least at linear order. This redshift-independence follows immediately from 
eqs. (O and (f25i 

Notice, finally, that WNLPT theory results are expected to hold in the correlation length range in which C12 can be 
unambiguously defined, that is up to the scale where the correlator kn crosses zero (see eq. Ill[) . This requirement sets an 
upper limit to the effective correlation scale n that can be investigated using predictions of WNLPT. In this large scale 
context, also notice that non-linear effects contributing to the baryon acoustic peak in the the two-point correlation function, 
would modify predictions obtained on the basis of the simple linear model of eq. (|17p . For these reasons we limit the present 
analysis to correlation scales nR ^ IOO/1 -1 Mpc. In a future work (Bel et al. in prep) we will compare WNLPT predictions 
against fully non-linear numerical results from matter simulations in order to constrain in a quantitative way the boundaries 
of the interval where WNLPT results safely apply. 
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Figure 1. Left: the 2-point correlation function of the smoothed mass density field fij is shown as a function of the smoothing scale R. 
We plot £fl for different values of the correlation length r = nR i.e. n = (black solid lines), n = 3 (red dashed lines), and n = 4 (blue 
dot-dashed lines) and a t two different redshifts: 2 = (thick lines) and 2 = 1 (thin lines). We adopt the linear power spectrum model of 
lEisenstein fc Hvj [|l998h and we assume the following set of cosmological parameters: Q m = 0.26, Qa = 0.74, Ho = 72 km s 1 Mpc , 
crg(0) = 0.79, f2;, = 0.044 and the spectral index n s = 0.96. Note that, for n = 0, §j;(0) = cr|j, while for a correlation length that goes 
to infinity the correlation function of the smoothed density field tends to zero. The characteristic bump induced by the baryon acoustic 
oscillations becomes clearly visible as soon as n is large enough. Right: the 2-point correlation function £ij(r) is shown as a function of 
the correlation length r for fields smoothed on different scales, that is R = 1, 10, 25 and 50h~ x Mpc. As the smoothing scale R tends 
to zero, one recovers the 2-point correlation function of matter particles, while as the filtering scale becomes larger, the BAO peak is 
progressively suppressed. Note also that for r — > 0, the value of £ii(r) saturates to the variance of the field on the given scale R. On the 
opposite sense (r — > oo) all the curves converge to the value of the two-point correlation function of matter particles. 



3 GALAXY CORRELATORS 

Smoothing is not the only process that leaves an imprint on the value of cumulant moments. Thei r amplitude and shape is 



also altered if galaxies, instead of massive particles, are used to trace the overall matter distribution. Fry fc Gaztafiaga ( 19931 ) 
computed the effects of biasing on one-point (smoothed) cumulant moments up to order 5. In this section, we generalize those 
results by deriving the expressions of the smoothed two-point cumulant moments of the galaxy overdensity field up to the 
same order. These are new observables that can be used to test predictions of the GIP in the weakly non-linear regime, to 
give insight into the gravity induced large-scale bias, and also to distinguish models with Gaussian initial conditions from 
their non-Gaussian alternatives. Some of these possibilities will be explored in sections §4 and §5. Since from now on we 
only consider smoothed statistics, we simplify our notation by omitting any explicit reference to the smoothing scale R. The 
R— dependence of relevant statistical quantities will be re-emphasized when necessary. 
Consider the iV-point, smoothed, correlation functions of matter up to order 5 

iij = Kii(xj,Xj) (26) 

(ijk = ftlll(xj,Xj,Xfc) 
Vijkl = Kllll(xi,Xj,Xfe,Xj) 
Uijklm = Klllll(Xi,Xj,Xfc,X;,X m ). 



At leading order, the corresponding statistics describing the distribution of galaxies, labeled with the suffix 'g' are 



£l2,9 
Cl23, S 



= ?<i 3 C123 + &i 3 C2 (£13623 + 2 perm) 



(27) 
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5 10 15 20 25 30 5 10 15 20 25 30 

R (h"'Mpc) R (h"'Mpc) 

Figure 2. Left : the logarithmic derivative of £n(r) with respect to R (cfr. eq. 1251 1 is shown as a function of the smoothing scale R for 
different values of the correlation length r = nR (i.e. for n = solid li ne, n = 3 short-d ashed line, n = 4 dot-dashed line and n = oo 
long-dashed line). We have assumed the linear power spectrum model of lEisenstein S z Hu (199^) and the same parameters value listed in 
the caption of Figure 1. The non-monotonic scaling of /3n(nR) at a scale R~ 25/i _1 Mpc is induced by the baryon acoustic oscillations. 
Note that, in the linear regime, /9fj(r) does not depend on the cosmic epoch, i.e. it is a redshift independent quantity. Right : the scaling 
of the reduced correlator of order 3 is shown as a function of the smoothing scale R for different values of the correlation length r = nR. 
Note that f or n = the corre lator reduces to the skewness S3 r and that for n = oo the expression of Ci 2 n{ n R) reduces to the one 
adopted by Bcrnardcau| lll996T ). 



»7l234, s = & i 4 ^1234 + &i 4 [c 2 (£23Ci24 + 11 perm) + c 3 (£i 4 £ 24 £ 34 + 3 perm) + c 2 2 (£i 3 £ 24 £ 34 + H perm)] 
W12345, 9 = &i 5 W12345 + &i 5 [c 2 (£is»?2345 + 19 perm + C125C345 + 14 perm) + c 3 (C135625645 + 29 perm) + 
C4 (C14C2464C45 + 4 perm) + c 2 2 (C135C24C25 + 119 perm) + c 2 c 3 (^14^25^34^24 + 59 perm) 
+C2 3 (^13^24^45^15 + 59 perm) ] . 

The mapping between correlators of mass (ft nm ) and galaxy (ftnm, 9 ) follows immediately by taking the two-point limiting 
case in the above expressions. Listed below are the results up to order n + m = 4 (including also non- leading terms): 

«11,9 = &1K11 + &i(c3 + Ci 2 c 2 )kh K 2 + l/26lC2«ll (28) 
Kia.s = 6 i(Ci2 + 2 c 2 )/tu k.2 + 6?c 2 «:? 1 + 6? (5/2 C3C12 + 2 c 2 Ci 2 + 3 c 3 c 2 + c 4 + 1/2 c 2 C* 22 + c 2 S z + C3S3 

+c 2 Ci3)kii k 2 + 6?(c 2 + 1/2 c 4 + 3c|Ci 2 + 2c 3 c 2 + c 3 Ci 2 )/t?i« 2 + 61C3C2K11 
Ki3,s = &i 4 (6c 2 2 + C13 + 3c 2 S , 3 + 3c 3 + 6c 2 Ci 2 )«;ii k 2 2 + &i 4 (3Ci 2 c 2 + 6c 2 2 )kh 2 k 2 + &i 4 c 3 /-cn 3 + &i 4 (3/2c 2 Ci 4 

+9/2 Ci 2 c 3 S 3 + 12 c 2 2 c 3 + 6 c 2 3 S a + 45/2 c 2 c 3 Ci 2 + 9 c 2 c 4 + 3/2 c 3 S 4 + 9/2 C4S3 + 3/2 c 5 + 15/2 c 3 2 + 3 c 2 2 S 4 
+3 c 2 2 C* 22 + 15/2 c 2 2 Ci 2 S 3 + 9/2 c 4 Ci 2 + 6 c 2 2 Ci 3 + 45/2 C3S3C2 + 5 c 3 Ci 3 + 6 c 2 3 Ci 2 + 1/2 c 2 C* 23 )kh k 2 3 
+fei 4 (9/2 c 2 2 Ci 2 2 + 6 c 2 2 Ci 3 + 3/2 c 3 C 22 + 39/2 c 2 c 3 Ci 2 + 3/2 c 4 Ci 2 + 9/2 c 3 S 3 c 2 + 3 c 2 4 + 18 c 2 3 Ci 2 
+3/2 c 3 Ci 2 2 + 15/2 c 2 c 4 + 18 c 2 2 c 3 + 6 c 2 3 S 3 ) Kh 2 k 2 2 + bi 4 (9/2 c 3 2 + 1/2 c 5 + 3/2 c 4 Ci 2 + 9 c 2 c 3 Ci 2 
+6 c 2 2 c 3 )kh 3 k 2 + 3/2 6i 4 c 2 c 4 Kn 4 

where a = bi/bi for i ^ 2. In appendix C, we list the expressions for the galaxy correlators up to order 5. For xi = x 2 , one 



The amplitude of large-scale matter fluctuations 9 



recovers the expressions of Fry fe Gaztafiaga 1 19931 ) (cfr. their eq. 9)0 The leading term in eq. (|28[) reduces, in the one-point 
limiting case, to 



«2, 9 



1.2 

b 1 K 2 . 



(29) 



We find that, to leading order in the product k\\k 2 the remaining results, K„ m) , for n + m ^ 3, preserve the hierarchical 



properties of matter correlators, i.e. K nrl 



m,g K ll,g r *' 2 ,g 



K 



n+ro— 2 



with amplitudes C nm ,g given by 



Cl2, ff = &l~ 1 (Cl2+2 C2 ) 

Ci 3>g = &r 2 (Ci3+3c 2 (S'3 + 2C , i2) + 3c3 + 6c;!) 

C 2 2, 9 = br 2 (C22 +4c 2 Cl2 +4C 2 2 ) 

Ci4, 3 = br 3 (C14 + 4 c 2 (S4, + 3 C13) + 12 (c 3 + 3 c|) (S 3 + C 12 ) + 12 c 2 C 12 S 3 + 4 c 4 + 36 c 3 c 2 + 24 c 3 2 ) 

C 23 ,g = &l"" 3 (C23 + 2 C 2 (C13 + 3 C22) + 3 ( C3 + C 2 S 3 )C 12 + 6 c| (S3 + 3 C12 ) + 6 c 3 c 2 + 12 c|) . 



(30) 



These relations show that in order to draw any conclusions from the galaxy distribution about matter correlations of order 
N, properties of biasing must be specified completely to order TV — 1. Note, also, that the equations (|30[) have been obtained 
in the large separation ap proximation and f a il as s oon as |xi — X2I < R. As a consequence, in the one-point limit they do not 
converge to the results of lFrv fc Gaztafiaga ( 19931 ) on the amplitude of the reduce cumulants, that is 



S3,g = 6^(^3+3 02) 

S 4 , 9 = &j- 2 (S4 + 12c 2 S'3+4C3 + 12C2 2 ) 

S 5 , g = 67 3 (S 5 +20c2S4 + 15c2S3 2 + (30c3 + 120c2 2 )5 3 + 5c4 + 60c2C3 + 60c2 3 ) 



(31) 



Interestingly, the hierarchical scaling (cfr eq. |TT]) ) is not the only matter property which survives to the local non-linear 
biasing transformation of eq. Q] We also find that 



C 22 ,g — Cl 2 ,gC\ 2 ,g 
C 2 3,g — Cl2, S Cl3,g 



(32) 



that is the reduced galaxy correlators G nm ,g, conserve the factorization property of the matter density field (cfr. eq. lfl2"jl.) 



4 NON LINEAR BIAS IN REAL SPACE 



The most common methods for estimating the amplitude of the non-linear bias coefficients bj rely upon fitting a theoretica l 
model to higher ord er statistical obser vables, such as the 3-point correlation function (e.g. Gaztafiaga fc Scoccimarrol 2005 '). 
the bispectrum (e.g. Verde et a! 2002T h or the galaxy probability distribution function (e.g. Marinoni et al .1 120051 ). 

In this section we take a different approach. We explicitly derive analytical relations directly expressing the non-linear 
bias coefficients bi as a function of high order observables. To this end we cannot simply invert the set of equations [30] (or [31]), 
since the JVth reduced galaxy correlator (or moment) are a function of N + 1 bias coefficients. We add, instead, the expression 
of the 3rd order reduced correlator Ci2, 9 to the system of equations (|31[) and solve the resulting set for the biasing coefficients. 
We obtain 



hi 

bz 
b. 



3 C12 — 2 S3 
3 Ci2,g — 2 5*3,3 

(Ci253, 9 — 53Ci2,g)(3Ci 2 — 2S3) 
(3Ci2, 9 - 2 5 3 , 9 ) 2 

(9 54, 9 C 2 2 — 12 S4,gCi 2 Ss + 4 S4, 9 S 2 — 12 S3S3, 9 Ci 2 Ci2, 9 + 24 535 2 , s Ci2 — 24 5 2 53, 9 Ci 2 , 9 + 24 S^Cx 2 ^ g 



(33) 

(34) 
(35) 



1 Actually wc found two typos in their eq. (9). The term 210c|S3 is actually 210c|53 and 180c|54 should read I8OC2S4. Anyway such 
terms were subsequently neglected in that analysis, leaving the author's conclusions unaltered. 
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-9S 4 C7f 2)9 + 12 S 4 C 12 , a S 3 , g - 4 5 , 4 S 3 2 9 - 12 S 3 2 9 C 2 2 )(3 C 12 - 2 5 3 )/4(3 C*i 2 , g - 2 5' 3 , 9 ) 3 
6 4 = (-60 S 3 3 S 4 , 9 Ci2, 9 - 108 5 5 , 9 Ci2 2 S 3 - 120 S4S3JS3 - 600 S 3 ,g 3 S 3 Cl 2 + 495 S±S 3 Cl 2)g - 600 S 3 , g 2 SlC 12>g (36) 
-270 Ss^S^gClz + 72 Ss, g C 12 Sl + 108 S 5 G^ g S 3 , a + 1200 S 3 , fl S|C? 2 , 9 + 600 5|, 9 5 3 Ci 2 - 40 S 4 S 3 , 9 Ci2 
+120 S , 3 , 9 S , 4, 9 S 3 — 72 SsCi2, s S 3y g — 90 SiS 3} gC\2C\2,g + 120 S , 4S ,2 , 9 Ci2Ci2, 9 — 930 SAS 3 , g S 3 Ci2,g 
+580 S4S' 3i9 S' 3 Ci2, 9 + 630 S , 3 , 9 S , 4, 9 Ci2>S' 3 — 480 S , 3, 9 S4, 9 Ci2S' 3 2 + 180 S 3>g S 3 C\ 2 Ci 2 , g — 600 S 3i9 5'|Ci2Ci2, 9 

+270 S3,gS 3 Cl2Cl2,g ~ 135 S3 S4,,gCl 2 Cl2,g + 180 S 3 Sa, 9 Ci2 Cl2, 9 + 54 S5,gCf 2 ~ 16 Ss,gS 3 — 54 SsCl 2 ,g 

+16S 5 S 3 3 9 + 240S 3 3 9 C 1 3 2 - 690S 3 3 C 1 3 2, 9 )(3Ci2 -2S 3 )/lO(3Ci2, 9 -25' 3 , 9 ) 4 . 



This set of equations allow us to investigate the eventual non-linear character of the biasing function up to order 4 by 
exploiting information encoded in the reduced correlators up to order 3 and the reduced moments up to order ^ 5. In a 
forthcoming analysis we will investigate up to what precision the coefficients 6j can be estimated using data of large redshift 
surveys such as BOSS, BigBOSS and EUCLID. Note that, if we set Pr(t) = in eq. (1231) . then o ur expressions for 6i and 62 
(cfr. should eqs. (|33[) and (|34[l ) reduce to equations (4) and (5) originally derived by ISzapudil|l998h . As stressed by this author, 
in this formalism biasing coefficients are not anymore simple parameters to be esti mated (by maximizing, for example, the 
likeli hood of observable s that are sensitive to them s uch as the reduced skewness S3 jGaztanaga|[l994l ; iGaztanaga fc Friemanl 

2002), 



19941). the bispectrum ijFrvj 



1994! 

( Gaztanag a fc Scocc imarro 

iMarinoni et alJ (|2005l . I2008T ). but they become themselves estimators. 

In what follows we will focus our analysis on the linear biasing parameter bi only. In our formalism, this quantity is 
explicitly expressed in terms of the amplitude of third-order statistics (i.e. the reduced cumulants and correlators) and it 
is independent from the amplitude of second-order statistics such as or and ^r. Notwithstanding, b± seems to depend on 
the shape of the power spectrum of the matter density fluctuations via the terms 7_r and Pn{r) that appears in eq. (|23l) . 
We will demonstrate in the next session that the inclusion of the correction term Pn(r) in our analysis has the additional 
advantage of making the linear bias coefficient 61 effectively independent from any assumption about second order statistics, 
i.e. independent not only from the amplitude but also from the shape of the linear matter power spectrum. This fully third- 
order dependence of the linear biasing parameter estimator b\ allows us to construct a consistent second-order estimator of 
the matter density fluctuations or on a given linear scale R. 



19941; Scoccimarrc 



1998; 



Feldman et al. 2001; Verde et al 



20051 ; |Pan fc Szapudilbooa )" or the full probability distribution function of the density fluctuations 



5 THE RMS FLUCTUATIONS OF THE LINEAR MASS DENSITY FIELD 

Now that all the ingredients are collected, we detail how we construct our estimators of the linear matter density fluctuations 
or. From eq. ([22]), p3]>. (f29|) and ([33j) we obtain 



T 9,n( r ) ,o 7 \ 
a R = o 1 \ a a. R 

where 

T g , R (r) = 3C*i2, 9 , fl (r) - 2S 3 , g ,R (38) 

and where the suffix g indicates that the relevant quantities are evaluated using data. For clarity, the scale dependence of 
third order statistics is explicitly highlighted. Apparently, the right-hand side of the above equations depends on the overall 
shape of the a-priori unknown matter power spectrum. In reality, the terms /3fl(r) — 7_r can be consistently estimated from 
observations without any additional theoretical assumption. To show this, we define 

< 39 > 

where 

Mr) = (40) 
"r 

As far as matter particles are considered, the previous definitions imply that a R (r) = P R (r) —^r. By combining the expression 
for ^ 9 ,h given by eq. [551 with its one-point limiting case i.e. 

oIr(z) = bloKz) {1 + (l/2c 2 2 + S 3 ,RC2 + c 3 ) o%(z)} , (41) 
and using the fact that, on scales where WNLPT applies, o\ <C 1, we obtain 

Vg.nir) ~ VR (r) - {(S 3 ,r - Ci2,fl)c 2 + l/2c 2 2 } fflfr) + l/2c 2 V(r)£ fl (r), (42) 
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where rj gtR (r) = £g,R(r)/(Tg tR and where the terms on the RHS have been sorted by order of magnitude. Finally, in the LS 
limit [^r(t) is negligible with respect to r/i?(r)], the above equation reduces to ?7 9 ,i?(r) ~ rjR{r). The level of accuracy of this 
approximation is presented in Figure [3] where we show that, on a typical scale (R ~ 16/i _1 Mpc), the imprecision is less than 
0.5% at any cosmic epoch investigated (0 < z < 0.6). Since a 9 ,ii(r) = Qfl(r), we obtain 

where a 9 ,jj(r) = dlogi] giR (r) /dlog R. 

Counts-in-cells techniques provide and estimate of the terms on the RHS of eq. 1431 In this regard, a central point worth 
stressing concerns the continuum-discrete connection. Biasing is not the only obstacle hampering the retrieval of matter 
properties from the analysis of galaxy catalogs. The formalism needs also to correct for the fact that the galaxy distribution 
is an intrinsically discrete process. These issues will be thoroughly addressed in Section §6.1 where we present the strategy 
that we have adopted in order to minimize the sampling noise. 

Redshift space distortions are an additional effect that needs modeling. Results presented in the previous sections strictly 
hold in real (configuration) space. Therefore, the feasibility of extracting the value of mass fluctuations or via eq. (|43p rests 
upon the possibility of expressing real space variables (b^ 1 = t 9 ,_r/q 9i _r, and (t 9i _r) i n terms of re dshift space observables. 



On the large (linear) cosmic scales where our formalism applies, the Kaiser model (e.g. lKaiserlll987f ) effectively describes the 



mapping between real and redshift space expressions of second order statistics. The transformation is given by 

1/2 



'g,R 



+ 11 + 1(1)' 

3 6i 5 \bj 



<7g,R, (44) 



where the suffix z labels measurements in redshift (as opposed to configuration) space, and where / is the logarithmic derivative 
of the linear growth factor D(a) with respect to the scale factor a. Much easier is the transformation rule for a g ,R\ in linear 
regime it is unaffected by redshift distortions, that is a g R = a 9tR . 

Assessing the impact of peculiar motions on third order statistics is less straightforward. Up to now all the formulas 
were derived analytically from theory. To address this last issue we now use numerical simulations. By running some tests 
using a suite of simulated galaxy catalogs (described in section §6.2) we conclude that the amplitude of 53, 9 and Ci2, 9 
are systematically (and non-negligibly) higher in z-space and that the relative overestimation systematically increases as a 
function of the order of the statistics considered (see Figures [5] and [6] in section §6.2). Notwithstanding, from the theoretical 
side, the expressions of third order statistics (S3.R = S3 + Jr and C12 = (25a)/3 + Jr/3 + /3 R (r)/3) imply that the linear 
combination ?>C\2,r — 2Ss y R should be much more insensitive to redshift distortions. Note, in particular, that both 7^ and 
Pr(t) are unaffected by linear motions. However convincing it might seem, this guess applies only to matter particles. In order 
to draw definitive conclusions about the impact of peculiar motions on t 3i r, we have used N-body galaxy simulations. Guided 
by synthetic catalogs we demonstrate that the biased galaxy statistics t 9 ,r is effectively unaffected by redshift distortions. 
This conclusion is graphically presented in Figure [3] where we show that the relative error introduced by reconstructing the 
statistics using observed redshifts, instead of the cosmological ones, is progressively smaller as R increases, and it is globally 
<~ 2%. 

By incorporating these results into the formalism we finally obtain 



or = o g ,R 



r X g,R J 3 r| iH 5 



-1/2 

(45) 



an estimator that is manifestly independent from any assumption about the amplitude and shape of the linear matter power 
spectrum. Also, this formula is independent from any assumption about the value of the Hubble constant Ho. Only an a-priori 
gravitational model must be assumed to correct for redshift space distortions in the local universe, that is to evaluate the 
growth rate function /(z). This introduces an additional strong dependence on the cosmological parameters Q.m and Qa on 
top of the marginal one that is forced upon when we compute metric distances in order to estimate o gR , a g R (r) and r' g R . 
More quantitatively, when the input cosmological parameters are chosen in the parameter plane delimited by ^ Qm 1 and 
^ Qa ^ 1 , the maximum relative variation of the estimates with respect to their fiducial value in the ACDM cosmological 
model are max|d///| ~ 0.6, maxjcir/r| ~ 0.2, max|da/a| = 0.3, and max\da/a\ = 0.15. The weak cosmological dependence 
of r g R follows from the fact that neither the reduced cumulants nor the reduced correlators of mass are effectively sensitive 
to the background cosmology. We expect them to be essentially unaffected also by sensible modifications of the gravitational 
theory (Gaztanaga & Lobo (2001), Multamaki et al. (2003), but see for example Freese & Lewis (2002) or Lue, Scoccimarro 
& Starkman (2001) for more radical scenarios where this expectation is not met). We have also noted that the denser in 
matter is the cosmological model the more overestimated are the values of both of oz g R and r g R . Since also the amplitude 
of the cosmological dependence is nearly the same, we expect the ratio ot z g R jT g R , i.e. the linear bias parameter, to be nearly 
insensitive to the underlying cosmological model. We have quantitatively verified this statement in Figure 11 of section §6.5. 
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r=3R 



r=3R, <z> = 0.34 



R=1 6 h"'Mpc 




«5„> = 0.001 ±0.008 



0.2 



0.3 



0.4 
redshift z 



0.5 




-0.4 



20 

R (h"'Mpc) 

Figure 3. Left: the relative error arising from estimating the ??-function using galaxies instead of matter. The inaccuracy 5 V = if r/Vr ~ 1 
is evaluated at different epochs and is determined by assuming R, = 16/i -1 Mpc and r = 3R. Simulated data are extracted from the mock 
catalogs described in section §6.2. Right: relative difference between the real- and redshift-space estimation of T g r = 3Ci2, 9 (r) — 253 l9 . 
The inaccuracy 8 T = r(t)/t r{t) — 1 is plotted as a function of the smoothing scale R and is computed at a separation r = 3R. The 
dotted line represents the case in which computing the r statistics using observed redshifts is equivalent to using cosmological redshifts. 
Real- and redshift-space simulated data are extracted from the mock catalogs described in section §6.2. In both panels the error bars 
represent la standard deviation. 



In conclusion, the estimator in eq. (|45l) is sensitive to cosmology mainly through the growth rate function f(z) and, to some 
de gree, through o~ g R. 

Once the cosmologic al background is known via independent techniques (e.g Astier et al, ( 20061 ) ; Komatsu et al. ( 2011 ); 



Marinoni fc Buzzil (|2010j))) the strategy we have outlined offers the possibility to estimate in a direct way the amplitude and 



time evolution of matter fluctuations. The formalism could also be implemented, in a reverse direction, to probe the coherence 
of the gravitational instability paradigm. Any eventual discrepancy resulting from the comparison of the measurements (cfr. 
eq. (05])) with theoretical predictions (cfr. eq. (|18[l) provides evidence that either the assumed set of cosmological parameters 
are wrong, either the assumed power spectrum of matter fluctuations is poorly described by linear theory, either the time 
dependence of the linear growing mode D(t) is deduced in the context of an improper gravitational model. This last testing 
modality will be explored in a further paper. 



6 APPLYING THE METHOD 

The practical implementation of the method, including successful tests of its robustness, is discussed in this section. We first 
present a strategy to estimates the correlators of order (n, m) of discrete 3D density fields such as those sampled by galaxy 
redshift surveys. We then show that, by applying the test formalism to N-body simulations of the large scale structure of 
the universe, we are able to recover the amplitude and scaling of the linear matter fluctuations or(z). A strategy to test the 
coherence of the results and to validate our conclusions is also designed, applied and discussed. 



6.1 Statistical estimators of the galaxy correlators 

The galaxy distribution is a discrete, 3-dimensional stochastic process. The random variable N models the number of galaxies 
within typical cells (of constant comoving size) that ideally tesselate the universe or, less emphatically, a given redshift survey. 
Notwithstanding, a variable that is more directly linked to theoretical predictions of cosmological perturbation theory is the 
adimensional galaxy excess 
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where N is the mean number of galaxies contained in the cells. 

To estimate one-point moments of the galaxy overdensity field (/J.n, g = (Sn)), we fill the survey volume with the maximum 
number (Nt) of non-overlapping spheres of radius R (whose center is called seed) and we compute 

1 Nt 

An, 9 = Jj-z2 S ff,u (47) 

i=i 

where 8n,z is the adimensional counts excess in the i-th sphere. 

As far as two-point statistics are concerned, we add a motif around each previously positioned spheres. The center of 
each new sphere is separated from the seed by the length r — nR and the pattern is designed in such a way to maximize 
the number of quasi non-overlapping spheres at the given distance r (the maximum allowed overlapping between contiguous 
spheres is 2% in volume.) The moments H nm ,g = {^jv»^Wi)) that is the average of the excess counts over all the i and j 
spherical cells at separation r, are estimated as 



(48) 



where N mo t is the number of spheres at distance r from a given seed, and where we have assumed that the stochastic process 
is stationary. The expression of the one- and two-point cumulant moments k„ m , g follows immediately from eq. ([9]). 

Since galaxies counts are a discrete sampling of the underlying continuous stochastic field A 9 (x) (see section §2.1) it is 
necessary to correct our estimators for discreteness effects. In other terms, the quantity of effective physical interest that we 
want to estimate is 5 g> ii(x) = A 9 (x)/A 9 — 1 where A 9 (x) = f v i x \ X(x.')d 3 x' is the continuous limit of the discrete counts N 
in the volume V. To this purpose, following standard practice in the field, we model the sampling as a local Poisson process 
fLPP. lLavseJ l|l95d) ) and we map moments of the discrete variable N into moments of its continuous limit by using 




(A^) = (N(N-l)...(N-n+l)) = ((N)]), 



in the case of one-point statistics, and its generalization (ISzapudi fc Szalay| |1997) 
(A^( Xl )A™(x 2 )) = ((Ni) 1 }(N 2 )f l ), 



(49) 



(50) 



for the two-point case. As a re s ult, the estimators of fc nm , 9 
( Szapudi. Szalav fc Boscanlll992l ; Ungulo et al.ll2008h 



{8g iR (x)5™ R (x + r)) c corrected for shot noise effects are 



fc 2, 9 




£2,9 - 


kll,g 




£11,9 


&3,g 




£3,9 - 


fcl2, 9 




£12,9 - 


fc4,g 




A4,9 - 


fcl3,g 




£l3,9 - 


fo>2,9 




£22,9 - 


k 5,g 




£5,9 — 


fel4,g 




£l4,9 - 


"-23,9 




£23,9 - 








where 


we 


have set 



(51) 



A _1 £ii,9 

?AL - 6JV -1 /t8,g + HA r_2 A2,9 - 6A" 3 
3/t2, 9 £n,9 + 2A~ 2 /iii,9 - 3A" -1 /ii2, 9 

2 £ll,9 ~ A2,9 + A _2 /in,9 - 2A _1 /il2,9 
10/i 2 , 9 /t3,9 - 10A^ _1 {/i4,g — 3/^2,9} + 35A" _2 /i3,9 ~ 50A"~ 3 /i2,9 + 

C,"... A. A?. . C ]\T _1 f .-, . OA t ' T 1 A7 — 2 



>• ■ 24N~ 4 



— 6N {Al3,g — 3/tn >s /i2,gj i f*LJ,g ™. cn, s 

- 3A _1 {A22, 9 - 2Aii, 9 - A*2, 9 } - A _1 {Ai3, 9 - 3/i 



- 3/i2,9£l2, 9 - £ 2 , 9 £3, 9 



, 9 } + llA- 2 Ai2, 9 -6JV _3 An, 9 

l {£l3,g — 3£ll,g£2,g} + 5A"~ 2 /}l2, : 



+ 



— £i0,g — £«,9 and ^0i, 9 — kiO,g — ki,g- 



6.2 High order correlators extracted from cosmological simulations 



We use numerical experiments simulating the spatial clustering of Luminous Red Galaxies (LRGs) of the Sloan Digital Sky 
Survey (SDSS) to validate the method, to test its end-to-end coherence a nd to spot the presence of eventual systematics. We 
have run the whole pi peline on two differen t simulations, namely Horizon ( Kim et al. 20091 ; Dubinski et al. 2004 ; Kim fc Park 
2006) and Las Damas (jMcBride et a,l.|[2009l ') . Ideally, this way, we are the least dependent upon the specific simulation strategy 
and technique. Since the outcome and salient features of the analysis are essentially the same, we here only present the analysis 
of the Horizon simulations. This is a large simulation (4120 3 particles in the box) in which LRGs galaxies are selected by 
finding the most massive gravitationally bound, cold dark matter halos. It is characterized by the following set of cosmological 
assumptions (O m = 0.26, S1 A = 0.74, w = -l,w a = 0,H = 72km/s/Mpc,ri6 = 0.044, n s = 0.96, erg = 0.79). We have 
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Figure 4. Left: the correlator of order (1, 1) of the smoothed galaxy overdensity field at the average redshift z = 0.34 is shown as a 
function of the smoothing radius R, and for different values of the correlation length (n = {0, 3,4}). In the degenerate case n = we 
recover the variance of the galaxy fluctuation field. Right: the redshift evolution of ^g t n(nR) at a given smoothing scale (R = 17/i~ x Mpc) 
is shown for different values of the correlation length (n = {0, 3, 4}). The statistics are computed in both real (solid symbols) and redshift 
space (unfilled symbols). Each point is the average of the results obtained from 8 independent full-sky LRGs catalogs. Errorbars are 
estimated as the standard error of the mean. 
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Figure 5. Left: the reduced correlator of order (1, 2) of the smoothed galaxy overdensity field at the average redshift z = 0.34 is shown 
as a function of the smoothing radius R, and for different values of the correlation length (n = (upper panel) n = 3 (central panel) 
and n = 4 (lower panel)). In the degenerate case n = we recover the reduced skewness of the galaxy fluctuation field. Right: the 
redshift evolution of C\2,g,n{ n R) a t a given smoothing scale (R = 17h~ Mpc) is shown for different values of the correlation length 
(n = {0,3,4}). The statistics are computed in both real (solid symbols) and redshift space (unfilled symbols). Each point is the average 
of the results obtained from 8 independent full-sky LRGs catalogs. Errorbars are estimated as the standard error of the mean. 
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Figure 6. Tthe reduced galaxy correlators of order 4 (Ci3 i9 .h and C22,g,n) and of order 5 (C23, 9 ,ij) at the average redshift z = 0.34 and 
at the correlation separation r = 3R are shown as a function of the smoothing radius R. The statistics are computed in both real (solid 
symbols) and redshift space (unfilled symbols). Each point is the average of the results obtained from 8 independent LRGs catalogs. 
Errorbars are estimated as the standard error of the mean. 



analyzed 8 nearly independent, full-sky light cones extending over the interval 0.15 < z < 0.55, each covering a volume of 
13 ft _3 Gpc 3 and containing nearly 3.8 • 10 6 LRGs galaxies. A mass threshold decreasing with redshift was chosen so to force 
Horizon's comoving number density profile to be constant with redshift and reproduce the density profile observed in the SDSS 
LRGs sample. As a consequence, the simulated sample can be considered with good approximation as being volume limited. 
The mean comoving number density is n = 3.0 x 10~ 4 ft 3 Mpc -3 and the mean inter-galaxy separation is i ~ 19 ft 1 Mpc. 
As an example, the average number of LRGs galaxies (N) inside a spherical cell of radius R = (10, 15, 20, 25)h~ Mpc is 
approximately 1,5, 10 and 20. 

In the left panel of Figure[?]we show the correlation function of the smoothed galaxy overdensity field f Si ij = (<5 g> fl(x)<5 gi fl(x-|- 
r)) c for different values of the correlation length (r = nR) in both real and redshift spaces. Note that in the degenerate case 
n — we recover the variance of the galaxy density fluctuations on a scale R. If the cell separation increases, the amplitude of 
the galaxy correlators of order 2 decreases. Their i?-scaling is nearly similar to the slope of the analogous statistics computed 
for the matter density field (see Figure [2]) with a slope of ~ —2.25 at R = 10ft _1 Mpc and ~ —3.4 at R — 20ft _1 Mpc for 
the correlation configuration n = 3). Note also the neat appearance of the characteristic baryon acoustic peak at the scale 
R ~ 25ft -1 Mpc when the correlation length is computed for n = 4. We interpret these results as a qualitative indication of 
the fact that, at least on the scales explored by our analysis, the linear biasing parameter is well approximated in terms of a 
scale-free parameter. 

On the right panel of Figure [4] we show the redshift dependence of the (, g ,R for a given arbitrary smoothing scale (in 
this case R = 17ft _1 Mpc). The constant amplitude of this statistic, together with the fact that the corresponding matter 
statistics decreases by no more than ~ 0.05 in amplitude over the same redshift interval (0.15 < z < 0.55, see Figure EJ, 
provide evidences that linear biasing was nearly ~ 15% stronger at the early epoch z = 0.55. 

In Figure [5] we show the scaling of the one- and two-point reduced galaxy cumulant moments of order 3, namely S , 3, g ,ii 
and C\2, g ,R (r). The slight and systematic decrease of 53, 9 ,ij as a function of scales, much less pronounced than that of 53, r 
(see Figure[2|, is not compatible with biasing being described by a single constant parameter b\. Since, we have already argued 
that b\ is scale independent, this implies that the biasing function is non linear and the next order biasing coefficient 62 must 
show some scale dependency. 

Additional information about the clustering of large scale structures can be retrieved from the analysis of the comple- 
mentary third-order galaxy statistic, i.e. the reduced correlator Ci2, 9 ,fl(r = nR) on scales R where this indicator is not too 
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noisy. For both the correlation configurations n = 3 and n — 4 this requirement limits the region of interest to the scales 
R ^ 25h~ Mpc. Despite the fact that measurements on different scales are correlated, it appears that both these statstics are 
fairly independent from R. We remark that in the LS limit the value of Ci2, 9 ,H is also independent from the correlation scale 
r. As a consequence, eq. (|43[) is best evaluated by adopting the smallest possible value of n, i.e. the one that minimizes the 
amplitude of the errorbars. 

Our analysis shows that both the amplitude of S3, 9i r and Ci2,g,n(r) are nearly redshift independent, i.e. mostly inde- 
pendent from the cosmic epoch at which these statistics are computed. This property holds in both real and redshift spaces 
and mirrors the analogous behavior, predicted by theory, for the reduced cumulant moment of the matter field (see eqs. (|22|l 
and (|23[) .~) Because of this we can conclude that, at least up to redshift z ~ 0.55, even the next order biasing coefficient, i.e. 
62, is weakly sensitive to time in the Horizon simulations. 

The WNLPT predicts that reduced moments and correlators of the matter field should display hierarchical properties 
in real space. Figure [5] shows that the scaling predicted by eq. (|10[) still holds in redshift space, a results originally found by 
Lahav et al. 1993 and Hivon et al. 1995 who showed that it holds even on smaller scales than those analyzed here, i.e. on 
domains where non-linear effects become important. In an analogous way, Figure ((5} shows that the mapping between real and 
redshift space also preserves the hierarchical properties of the reduced correlator Ci2, 9 . This property is not a characteristic 
of low orders statistics only. In Figure [6] we present the estimates of the galaxy reduced correlators up to order 5 in both real 
and redshift space. Observations in the local universe have shown that the reduced cumulants Sn, 3 ,r of the smoothed galaxy 
field in redshift space display hierarchical clustering properties up to order N = 6 (e.g. Baugh et al. 2004). This plot shows 
that also the reduced correlators C nm ,g,R extracted form the Horizon simulation preserve the hierarchical scaling up to order 
5 in redshift space. Note that, as already anticipated in section §5, the amplitude of the clustering is larger in redshift space 
and that the relative difference with respect to real space estimates increases systematically as a function of the order of the 
reduced correlators. 

Finally, Figure [6] graphically displays the validity of the factorization property that we have found in eq. (|33[1 , i.e. that 
not all galaxy reduced correlators contain original information. We stress that even if the factorization property C*22, 9 = C12 g 
was shown to hold analytically only in real space, simulations now show that it holds also in redshift distorted space. 

We conclude by commenting on the precision of the estimates. The relative error in the estimation of reduced correlators 
increases, as expected, with the order of the statistics. Moreover, two-point statistics are recovered with larger uncertainty than 
the one-point statistics of the same order since they are estimated using a smaller number of independent cells. Specifically we 
find that, the larger the correlation scale, the stronger the sensitivity of the estimates to finite volume effects. On the typical 
scale R = 20ft _1 Mpc the relative error with which Ci2, 9 , C?i3,g, Ci4, 9 > C22, 9 and C23, 9 are recovered is 4, 5, 10, 15 and 33% 
respectively. This can be compared to the precision with which the equal-order reduced moments Sn have been estimated on 
the same scale, that is 0.7, 2 and 10%, for the order 3, 4 and 5 respectively. 



6.3 Estimation of or(z) 

In this section we test the efficiency of the estimator given in eq. (|45[) . Three major potential issues, if not properly addressed, 
may affect its reliability. First, it is imperative to test whether we can safely apply WNLPT results in the LS approximation 
to compute the re duced correlators C nm ,R(r) in the limit in which the cell separation is as low as r ~ 3R, as the analysis of 
Bernardeau jl996t ) suggests. Second, we want to verify that peculiar velocity corrections as well as redshift-space observations 



do not introduce unexpected biases into our real-space observables. Finally, we want to test if the local Poisson model fairly 
corrects for the sampling noise in the low counts regime. 

To fulfill these goals, we apply the estimator given in eq. (|45p to the simulated LRGs catalogs and gauge the precision 
with which we can retrieve the real-space amplitude and scaling of or, that is both the local normalization and evolution of 
the linear matter perturbations embedded in the ACDM simulations. 

The following argument help us to select the range of scales R that are best suited for applying the formalism to the 
simulated catalogs. We expect that the estimator given in eq. (|45[) will work neatly on sufficiently large scales R (where the 
WNLPT and the linear modeling of redshift distortions both apply) and on sufficiently large correlation lengths r = nR (where 
the LS approximation applies). On the smallest scale where the method can be theoretically applied, i.e. R — 10/i _1 Mpc, the 
amplitude of or, which is of order ~ 0.4 at the average redshift of the sample, is completely dominated by shot noise which 
is of the order ~ 1. The signal becomes dominant with respect to discrete sampling corrections as soon as R is greater than 
~ 15ft _1 Mpc. Moreover, below this last scale, a small, but statistically significant imprecision arises in assuming t 3) r = Tg R , 
as shown in Figure [3] On the opposite end, the largest scale R accessible is set by the geometry of the survey and the 
requirement of sampling the correlation length r — nR with sufficient statistical power. We find that the relative error in 
our estimate of Ci2, 9 ,h(?") becomes larger than ~ 10% (see Figure [5} for scales R > 22ft~ 1 Mpc when n = 3 and for scales 
R > 18/i _1 Mpc when n = 4. 

In Figure[7|we plot the estimates of <jr based on LRGs mock catalogs. We recover the rms of the linear matter fluctuation 
field on two smoothing scales (R — 17 and 20/i _1 Mpc) and for three different correlation lengths (n — 2, n — 3 and n = 4). 
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By contrasting our measurements against the theoretical predictions obtained by inserting into eq. (|18p the parameters used 
in the Horizon sim ulations, we find th at our reconstruction scheme fails when the correlation length is as low as n = 2. This 
was expected since Bernardeau (1996') already showed that the LS approximation does not hold on such small correlation 
scales. Effectively, when we probe larger scales (n = 3), the reconstruction becomes significantly more accurate (central panel 
of Figure [7J|, with the estimates of or at the average redshift of the catalog (z — 0.34) being affected by a relative error of 
of 13% and 15% on the scales R — 17 and 20ft _1 Mpc respectively. For n = 4 errorbars become too large for the estimates to 
be also precise. Additionally, we remark that our estimates seem to slightly overestimate the value of or on both the scale 



analy zed. This is also confirmed by the analysis of an independent mock catalog, i.e. the Las Damas simulation (IMcBride et al 



2009). As stressed in section §2.3, this is due to the increasing inaccuracy in the theoretical prediction of the amplitude of the 
reduced correlators Ci2,R on correlation lengths that approaches the scale where £r crosses zero. 

We re-emphasize that these estimates are totally independent from any assumption about shape and normalization of the 
power spectrum of linear matter fluctuations. They are also independent from the amplitude of the present day normalization 
of the Hubble parameter Ho- Our estimates, however, do depend on the set of cosmological parameters (Qm, £^a) that we have 
used to assign galaxies to cells (i.e. to smooth the galaxy distribution), and to subtract the effect of redshift space distortions 
(i.e. to evaluate the growth rate function /). 



6.4 Estimation of the local value of erg 

As we have already discussed, the scale R — 8/i _1 Mpc falls outside the range of applicability of the test. Nonetheless, we can 
extract information about the value of erg at redshift z ~ (crs(0)) from measurements of or on larger scales. We do this by 
fitting eq. (|18p to data. The price to pay is that the recovered value will depend on the adopted power spectrum model and on 
the set of parameters on which the power spectrum itself depends, i.e. the reduced Hubble constant h, the primordial spectral 
index n s , and the reduced density of baryons (fib). This approach, however offer some advantage: if the relevant cosmological 
parameters fl m , flA and h are considered known from independent probes, then we can extract information about the purely 
gravitational sector of the theory. 

The way we proceed is as follows: we assume standard gravity as described by GR, we fra me our analysis in the li near 
regime, i.e. we adopt the phenomenological description of the matter power spectrum given by Eisenstein fc Hu ( 19981 ). as 



well as the time evolutionary model for or given in eq. (|18|l , and we look for the the best fitting parameter n 3 , fib and as (0) 
that minimize the statistical distance between our measurements (cfr. eq. (|45|ll and theoretical predictions (cfr. eq. ((THJ) . The 
outcome of this approach is displayed in Figure [8] 

If we assume that n s and fib are fixed to the simulation's values (n s = 0.96, fib = 0.044), we obtain ag(0) = 0.79 ± 0.08. 
This best fitting value is in perfect agreement with the simulated one (os = 0.79). Results obtained by performing a joint 
two-parameter analysis (after fixing the third parameter to the simulated value) are shown in Figure [9] The left and central 
panel of this figure reveal that os(0) is only marginally degenerate with respect to both n s and fib, a fact that highlights the 
fundamental inefficiency of our probe in constraining the values of both these parameters. This conclusion is reinforced in 
the right panel of the same figure, which displays a strong degeneracy between n s and fib together with a loosely constrained 
confidence region in the corresponding parameters plane. Luckily, this means that the uncertainties with which both these 
parameters are estimated using more sensible probes, do not critically affect the precision with which our method constrain 
the amplitude of o%(0). 

Despite the fact that our analysis was performed by slicing the survey volume in independent redshift shells, the limited 
redshift interval explored allows us to fix only the local amplitude of the linear matter fluctuation field. In a future work we 
will show that by implementing the method with deeper mock catalogs simulating the region of space that will be surveyed 
spectroscopically by surveys like BigBOSS and EUCLID, one can further aim at constraining the time evolution of cjr. Data 
on a larger redshift interval will allow to constrain not o nly os(0), but also the growth index 7 in terms of which the growth 
rate is usually parameterized (/ = f!J,(z) Peebles ( 198Ch ). This will allow to reject possible alternative description of gravity, 



or, in turn the standard model of gravitation itself. 



6.5 Consistency tests 

We have shown that, by using simulations, it is pretty straightforward to assess whether the proposed measuring strategy is 
able to recover the underlying value of or. What if, instead, a real redshift survey is considered? Are there specific physical 
criteria or statistical indicators that guarantee us that the recovered value of or is the true one? In other terms we want to 
shift our attention from the precision of the estimates to their accuracy. Apart from the unbiasedness of the WNLPT results 
in the LS limit, our test strategy strongly relies on assuming that the correct set of cosmological parameters (flu, ^a) has 
been used in the analysis. As a consequence, any imprecision in the measurements of the reduced cosmic densities translates 
into a biased estimate of or. In this section, we design a diagnostic scheme to test the coherence of our results and, at the 
same time, the soundness of the adopted values for Qm and flA- 
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Figure 7. The real-space rms of the matter fluctuation field is shown as a function of redshift. Black filled points represent estimates 
obtained by implementing eq. (14 "it to the LRGs mock catalogs. Solid lines show the theoretical prediction obtained by inserting all 
the physical and cosmological parameters of the Horizon simulation into eq. H81 . Count-in-cell and cell correlation analyses have been 
performed by assuming spherical cells of radius R separated by correlation lenght r = nR. We present results obtained for n = 2 (left 
panels), n = 3 (central panels) and n = 4 (right panels) using two typical cell sizes R = 17/i — 1 Mpc (upper panels) and R = 20h~ 1 Mpc 
(lower panels). Each point is the average of the results obtained from eight independent full-sky LRGs catalogs containing nearly 3.8 
million galaxies each. Errorbars are estimated as the standard error of the mean. At the mean redshift of the catalogs (z = 0.34) we also 
display the average estimate of er^j obtained in the whole survey volume (unfilled blue points). 
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Figure 8. The real-space rms of the linear matter density fu^^uaiions^on a scale R = 17/i _1 Mpc is shown as a function of redshift. 
The same data as in the upper central panel of Figure [7] are used. The solid curve represents the best fitting theoretical model for the 
linear scaling of an (cfr. eq. 1181 obtained after fixing the amplitude of f2(,, n s and h to the values of the Horizon simulation. The 
corresponding best fitting value of o"g(0) is shown in the inset together with its standard deviation. 
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Figure 9. The likelihood contours obtained from the joint estimation of (ag(0),n B ), (crg(0), Qi,) and (Qf,,n s ). The analysis have been 
performed calculating reduced correlators on a scale R = 17h _1 Mpc and at the correlation lenght n = 3. Isocontours of differently 
colored region corresponds to C m i n + 2.30 and C m i„ + 4.61 where C is proportional to the log of the likelihood of the data and it is here 
assumed to be affine to the x 2 statistic. On each panel the filled points represent the fiducial value of the Horizon simulation. 



In the approach developed in this paper, the linear biasing parameter in real space is directly estimated from redshift 
space observables of intrinsic third-order nature using the estimator 

bi, R = (52) 
T g,R 

As discussed in Section §5, this estimator has the remarkable property of being approximately independent from cosmology. 

Now let's define two new estimators of the real space linear biasing parameter as bi : R = Og^jaR and bi t R = £, g ,R/(,n, 
where now we exploit, as it is usual, second order statistics. Both <7 9i n and are quantities not directly measurable, 
nonetheless we can recast the expressions of the real space linear biasing parameters in terms of redshift space observables. 
By adopting the Kaiser model for linear motions we obtain 



k " ' -i + \fWf-^* (53) 
- -i + \f¥ 7 ¥'' <54) 

Before proceeding further, note that in this paper we have assumed that the Kaiser linear modeling of redshift space 
distortions applies on the scales we are interested in. We can now verify this statement by using the LRGs synthetic catalogs. 
To this purpose, and without loss of generality, be bi t R the value of the real-space linear bias parameter estimated from the 
mock catalogs as y/^ g> R/^ R . Let's refer to this estimate as to the true value of the linear-bias parameter and let's label it as 
b\ h . In figure [10] we compare these measurements against the estimates of the real-space linear bias inferred using eq. (|54|l 
in three different redshift intervals and for various smoothing scales R. One can see that for n = 3, the estimator (|54|l fairly 
recovers the real-space value of the linear bias parameter. This result lends support to the hypothesis that, at least in the 
Horizon simulations, and over the range of R scales where we trust the correlator's theory, the distortions induced by large 
scale peculiar motions are accurately described by the Kaiser model. 

Physically, we expect that, whatever is the chosen scale R, if o$(0) has been consistently determined, then eqs. (|52[) , (|53[l 
and (I54|l all give the same numerical result. Since the linear biasing estimators given in eqs. (|53[) an d (|54[) depends on the 
chosen background cosmology, the correct set of cosmological parameters is thus the very one that makes all the three different 
linear biasing definitions converge to the same numerical value on all the scale R. In a different paper (Bel & Marinoni in 
prep) we show how this observation can be exploited to guess the background cosmology. In this paper, we use this property 
to gauge the consistency of our measurements of (jr, that is to verify that all the different estimators of the linear biasing 
parameters match only when the analysis is carried out in the proper cosmological background. 

In Figure |TT| we perform this test and show what pathological features do show up when the analysis relies on an 
improperly chose set of values (Qm,£Ia)- The algorithm goes as follows: we estimate or on a given arbitrary scale R (here 
we chose R = 16/i -1 Mpc), and in four different cosmologies (indicated in each panel of Figure [TTj). We then deduce the value 
of o"8(0) in each of these four scenarios. In doing this, we implicitly assume that the specific cosmology adopted in order to 
measure o\r=i6 is the correct one and that, as a consequence, the value of (78(0) inferred using any other scale R is identically 
the same. We then plug in this value of (7s(0) into eqs. eqs. (|53[) and (|54l) and compare the results with those obtained via 
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Figure 10. Black triangles show the R dependence of the real-space linear bias parameter i>i r estimated from redshift-space mock 
catalogs using eq. (I54D . Measurements are performed using the correlation length n = 3 and in three different redshift bins. These 
measurements are compared to the real-s[ace linear bias parameter b^ 1 extracted from real-space mock catalogs using the estimator 
■\J £g,n/£n (dotted line). The shaded area represents the region of 1-cr uncertainty. 



the estimator 6i,h, i.e. we contrast them against a measurement of the linear biasing parameter that is weakly sensitive to 
cosmology. 

If we analyze the LRGs mock catalogs by assuming the simulated set of cosmological parameters (f!jj = 0.26, £Ia = 0.74, 
see upper left panel of Figure ITT]) then the estimates of bi,R and bi,R are consistent between themselves on all the scales 
R. On the contrary, if we process data by incorrectly assuming a low density (open) background model (upper right panel 
in Figure [TTjl . the different estimations of the linear biasing parameter are not anymore in agreement, even if the estimated 
value of (78(0) has not changed (note that the linear power spectrum at present epoch is independent from the amplitude of 
the cosmological constant and thus insensitive to its variation). In particular, bi t R identically coincides, by definition, with the 
measure obtained using our estimator b\ t R only for R = 16ft _1 Mpc , but the estimates deviate on all the other scales. This 
discrepancy is due to the fact that, in a "wrong" cosmology, the extrapolated value of <ts(0) may not be independent from 
the scale R on which or is measured. Or said differently, the equation 6i,fl(crg(0)) = b\,R might not have a unique solution 
crg(0) for all the scales R. 

The effect previously discussed is mostly the consequence of adopting the wrong amplitude for the cosmological constant. 
It is, however, less pronounced than the discrepancy between the measurements of b\ t R and bi,R that arises when the value 
of Q m is poorly guessed. The effects of a wrong choice of the matter density parameter are presented in the lower panels 
of Figure 1111 The observed large inconsistency arises essentially from the fact that the zero order spherical Bessel function 
appearing inside the integral in eq. (|21|) filters in different portions of the signal (i.e. of Afj if the characteristic parameters of 
the linear power spectrum are changed. In other terms, if we consider eq. ()21[) and spuriously overestimate fi m , the predicted 
suppression of power on a scales r is larger than the variation actually seen in the data. 

We remark that our estimator of the linear biasing parameter, relying on third-order statistical indicators, is affected by 
errorbars that are larger than those associated to the classical estimators bi.R and bi.R. This imprecision is largely compensated 
by the fact that our estimator does not depend on any assumption about the nature of the dark matter (i.e. the specific form 
of the matter power spectrum) and it is almost insensitive also to its abundance (f2jvf). Therefore, by contrasting different 
estimations of the linear biasing parameter, using the diagnostic diagram of Figure ITTT we can deduce if the scaling of <jr was 
inferred in the appropriate cosmological model. 
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Figure 11. Comparison between the values of the linear bias paramefllr^on various scales -R. Measurements are obtained using 3 different 
estimators (cfr. eqs. J52}i A53|l . and l|54|l ) and by analyzing the same redshift data in 4 different cosmological backgrounds as specified in 
the panels. b\ n and bi n are estimated on the basis of the best fitting value of crg(0) indicated in each panel. This last value has been 
obtained from measurements of ur on a scale R = 16h~ x Mpc and by assuming, arbitrarily, that the linear power spectrum parameters 
are subject to the following constraints: h = 0.72 and f2j,/f2 m = 0.17 for all the cosmological models. Note that in the lower panel, bi r 
data are missing on a scales R = (19, 20)/i _1 Mpc. This is due to the fact that the argument of yj ( g ,R/^R is negative, i.e. in these extreme 
cosmological models, the predicted correlation function of the matter density field becomes negative on scales where the corresponding 
galaxy statistic is still positive. The horizontal dotted line shows the average of the measurements obtained with our estimators bi n in 
the true cosmological model (i.e. the ACDM model of the Horizon simulation) and it is reproduced identically in all the panels. This 
line graphically helps to highlight the relative insensitivity of our estimator bi n to the choice of the parameters (fijvf , £Ia)- Note how 
different estimates converge to the same value only when the analysis is performed in the true cosmological model. 



CONCLUSIONS 

A key goal that seems to fall within the technical reach of future cosmological experiments is to rule out (or in) eventual 
infrared modifications of the standard theory of gravity, as possibly manifested by the unexpected growth of cosmic mass in 
the weakly non-linear, low-curvature, high-redshift regimes. To fulfill this task, it is mandatory to devise sensible observables 
of the large scale structure of the universe. In this spirit, this paper focuses on the potential of clustering indicators that have 
been rarely explored in literature, that is the high-order reduced correlators C„ m of the 3D mass overdensity field. 

To fully exploit the richness of information contained in this two-point statistics, whose amplitude is analytically predicted 
by the weakly non-linear perturbation theory in the large separation limit approximation, we have derived the expressions of 
the reduced correlators of the smoothed galaxy density field (C nm ,g) up to order 5. We have found that they preserve both 
the hierarchical scaling and the factorization properties of the matter reduced correlators. 

Building upon these results we have worked out the explicit expressions for the bias coefficients up to order 4 and a new 
estimator to measure the rms of the linear matter fluctuations on a scale R directly from galaxy redshift surveys. The central 
result of this paper, namely the estimator given in eq. (|45[1 . has been tested using artificial galaxy catalogs and shown to 
recover fairly well the 'hidden' simulated value of or. 

Despite the fact that very large survey volume are needed to make the estimation of these observables accurate enough 
for cosmological purposes, the merit of this approach are evident: a) linear biasing is not a parameter that one needs to 
marginalize over, but a physical parameter that can be estimated in a totally independently way from any assumption about 
the structure of the linear power spectrum and the value of cosmological parameters, b) the real space linear biasing parameter 
can be measured directly using redshift space observables. c) The scaling of or can be inferred directly without imposing any 
a-priori constraint on the eventual non-linear and scale dependent nature of the bias function, d) The correlator formalism 
allows also for a self-consistent test of the coherence of the results obtained concerning the scaling of or, a step further in the 
direction of making cosmological results not only precise but also accurate. 

In this paper we have analyzed local simulations with the aim of testing principles and theoretical ingredients on which 
the proposed strategy relies. Work is already in progress to apply the formalism to the SDSS-dr7 data and to extract the local 
value of linear matter fluctuations on sensible scales R. The good news is that the next decade holds even greater prospects 
for growth of the red-shifts data base. Therefore, we also plan to implement the algorithm to mock catalogs simulating future 
large 3D surveys such as BigBOSS and EUCLID and forecast up to what order and precision the bias coefficients bi can be 
estimated, as well as, the figure of merit achievable on as and on the gravitational growth index 7. 
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From the theoretical side, valuable insights are expected from a reverse engineering on the proposed test. For example, 
we show in a different paper (Bel & Marinoni in prep.) that if the gravitational model is a-priori known then the formalism 
offers the possibility of narrowing in on the value of fundamental cosmological parameters such as fl m and S1a- Also further 
work is needed to understand how higher order real-space biasing parameters can be effectively retrieved from redshift space 
reduced correlators. Finally, interesting possibilities will open up if WNLPT predictions could be extended into the small 
separation limit where much more statistical power is locked. 

CM is grateful for support from specific project funding of the Institut Universitaire de France. We thank the anonymous 
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APPENDIX A: APPENDIX A: HIGHER ORDER GALAXY TWO-POINT CUMULANT MOMENTS 

Listed here are the amplitudes of the two-point galaxy cumulant moments. Correlators of the galaxy distribution have been 
computed up to order 5. 

2 2 / \ /222 

K 9 ,ll = f>l (c 3 + C12C2JK11 K2 + 1/2 61 C 2 Kn 

Kg, 12 = 6i 3 (Ci2 + 2c2)kii K2 + 61 3 (5/2 C3C12 + 2c2 2 Ci2 + 3C3C2 + C4 + 1/2 C2C22 + C2 2 53 + C3S3 + £2613)^11 K2 2 

+bl 3 (c 2 3 + l/2c 4 + 3c 2 2 Cl2 + 2c 3 C 2 + C3Cl2)Kll 2 K2 + f>i 3 C3C 2 /-cii 3 
K 9 ,13 = &1 4 (6C2 2 + C13 + 3C2S3 + 3C3 + 6C2Cl2)Kll ^ + &l 4 (3Cl2C2 + 6 C2 2 ) Kll 2 ft2 + frl^Kl]. 3 + ftl* (3/2 C2C14 
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